Querying Semistructured Heterogeneous Information

نویسندگان

  • Dallan Quass
  • Anand Rajaraman
  • Yehoshua Sagiv
  • Jeffrey D. Ullman
  • Jennifer Widom
چکیده

Semistructured data has no absolute schema xed in advance and its structure may be irregular or incomplete. Such data commonly arises in sources that do not impose a rigid structure (such as the World-Wide Web) and when data is combined from several heterogeneous sources. Data models and query languages designed for well structured data are inappropriate in such environments. Starting with a \lightweight" object model adopted for the TSIMMIS project at Stanford, in this paper we describe a query language and object repository designed speci cally for semistructured data. Our language provides meaningful query results in cases where conventional models and languages do not: when some data is absent, when data does not have regular structure, when similar concepts are represented using di erent types, when heterogeneous sets are present, and when object structure is not fully known. This paper motivates the key concepts behind our approach, describes the language through a series of examples (a complete semantics is available in an accompanying technical report [QRS94]), and describes the basic architecture and query processing strategy of the \lightweight" object repository we have developed.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SINGAPORE: A system for querying heterogeneous data sources

SINGAPORE (SINGle Access POint for heterogeneous data REposiotries) is a system for querying data sources which are structurally heterogeneous, i.e. they contain structured, semistructured and/or unstructured data. Its main focus is on o ering a uni ed query language, so that retrieving information can be supported by the traditional task of database querying, but also by a more vague or \fuzzy...

متن کامل

Integration of Heterogeneous Semistructured Data Models in the Canonical One

To provide for interoperability of heterogeneous information objects it is required to establish a global, uniform view of the underlying digital collections and services. An information model is needed which is able to express uniformly the structure and semantics of heterogeneous data collections as well as the available services. Usually the mediator's layer is introduced to provide the user...

متن کامل

Managing Semistructured Data with FLORID: A Deductive Object-Oriented Perspective

| The closely related research areas management of semistructured data and languages for querying the Web have recently attracted a lot of interest. We argue that languages supporting deduction and object-orientation (dood languages) are particularly suited in this context: Objectorientation provides a exible common data model for combining information from heterogeneous sources and for handlin...

متن کامل

Querying Semistructured Temporal Data

In this paper we propose the GEM Language (GEL), a SQLlike query language, which is able to extract information from semistructured temporal databases represented according to the Graphical sEmistructured teMporal (GEM) data model.

متن کامل

(Modal) Logics for Semistructed Data

The area of semistructured data includes collections of data items which have in some ways similar but not identical structure. Examples of semistructured data range from heterogeneous databases to the World Wide Web Abi97]. The area is obviously quite heterogeneous itself. However there are some important features common to all kinds of semistructured data, namely: data is represented as an ed...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995